Standard for morphosyntactic and syntactic corpus annotation: The Morphosyntactic and the Syntactic Annotation Framework, MAF and SynAF
نویسنده
چکیده
This talk is about the standards for morpho-syntactic and syntactic corpus annotation: MAF (Morpho-syntactic Annotation Framework, ISO/FDIS 24611) and SynAF (Syntactic Annotation Framework, ISO 24615:2010). Both standards complement each other and are closely related. In contrast to MAF, which describes features such as part of speech, morphological and grammatical features, SynAf describes relations between single words, and how words are arranged with each other and connected to build phrases and sentences. The talk presents an overview of the current state of both standards.
منابع مشابه
An annotation scheme for Persian based on Autonomous Phrases Theory and Universal Dependencies
A treebank is a corpus with linguistic annotations above the level of the parts of speech. During the first half of the present decade, three treebanks have been developed for Persian either originally or subsequently based on dependency grammar: Persian Treebank (PerTreeBank), Persian Syntactic Dependency Treebank, and Uppsala Persian Dependency Treebank (UPDT). The syntactic analysis of a sen...
متن کاملSerialising the ISO SynAF Syntactic Object Model
This paper introduces , an XML format developed to serialise the object model defined by the ISO Syntactic Annotation Framework SynAF. Based on widespread best practices we adapt a popular XML format for syntactic annotation, TigerXML, with additional features to support a variety of syntactic phenomena including constituent and dependency structures, binding, and different node types ...
متن کامل<tiger2/>: serialising the ISO SynAF syntactic object model
This paper introduces , an XML format developed to serialise the object model defined by the ISO Syntactic Annotation Framework SynAF. Basing on widespread best practices we adapt a popular XML format for syntactic annotations, TigerXML, with additional features to support a variety of syntactic phenomena including constituent and dependency structures, binding, and different node type...
متن کاملThe annotation of the C-ORAL-BRASIL spoken corpus using an adaptation of the Palavras Parser
This article describes the morphosyntactic annotation of the C-ORAL-BRASIL speech corpus, using an adapted version of the Palavras parser. In order to achieve compatibility with annotation rules designed for standard written Portuguese, transcribed words were orthographically normalized, and the parsing lexicon augmented with speech-specific material, phonetically spelled abbreviations etc. Usi...
متن کاملA Framework for Standardized Syntactic Annotation
We present in this poster actual work on the building of a standard for syntactic annotation in the framework of ISO TC37/SC4. We describe here mainly the meta-model for syntactic annotation, which is building on the actual ISO proposal for a standard for morpho-syntactic annotation (MAF) and which is embedded in running efforts for defining a generic linguistic annotation
متن کامل